ENGINEER-LINGUISTIC PRINCIPLES OF ANALYSIS TEXTS
نویسندگان
چکیده
منابع مشابه
Automatic Analysis of Hungarian Texts and Linguistic Data
1. First of all I would like to give an account of the practical experience gained in the course of processing the about 60,000 or so entries of a Hungarian unilingual (explanatory) dictionary (.4 magyar nyelv 3rtelmez6 szdt,~ra, vv. I-VII, 1959-1962). In this case by " t e x t " we mean this non-natural corpus, that is the sum total of the entries of the dic• tionary; and by linguistic data th...
متن کاملA Linguistic Analysis of Simplified and Authentic Texts
The opinions of second language learning (L2) theorists and researchers are divided over whether to use authentic or simplified reading texts as the means of input for beginningand intermediate-level L2 learners. Advocates of both approaches cite the use of linguistic features, syntax, and discourse structures as important elements in support of their arguments, but there has been no conclusive...
متن کاملLinguistic Processing of Texts Using Geppetto
We describe the linguistic analyzer of a prototype for Information Extraction from texts. Such analyzer uses information derived from a shallow processor to limit the computational cost of the analysis. At the same time, shallow techniques are used to collapse parse fragments when a complete parse is not possible. The linguistic analyzer has been built using GePpeTto, an environment that allows...
متن کاملSome principles of linguistic methodology
Current difficulties in achieving intersubjective agreement in linguistics require attention to principles of methodology which consider sources of error and ways to eliminate them. The methodological assumptions and practices of various branches of linguistics are considered from the standpoint of the types of data gathered: texts, elicitations, intuitions and observations. Observations of the...
متن کاملLinguistic profiling of texts for the purpose of language verification
In order to control the quality of internet-based language corpora, we developed a method to verify automatically that texts are of (near-) native quality. For the LOCNESS and ICLE corpora, the method is rather successful in separating native and non-native learner texts. The Equal Error Rate is about 10%. However, for other domains, such as internet texts, separate classifiers have to be train...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Science-based technologies
سال: 2009
ISSN: 2310-5461,2075-0781
DOI: 10.18372/2310-5461.3.5130